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SEQUENCE LISTING 




RECEIVED 



MAR - 1 2002 



(1) GENERAL INFORMATION 



TECH CENTER 1600/2900 



(i) APPLICANT: Lai, Preeti 

Bandman, Olga 



<ii) TITLE OF THE INVENTION: NOVEL HUMAN SODIUM- DEPENDENT 



(iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: US 
<F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

<D) SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0221 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 



PHOSPHATE COTRANS PORTER 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Gin Val Asp Glu Thr Leu He Pro Arg Lys Val Pro Ser Leu Cys 

15 10 15 

Ser Ala Arg Tyr Gly He Ala Leu Val Leu His Phe Cys Asn Phe Thr 

20 25 30 

Thr He Ala Gin Asn Val He Met Asn He Thr Met Val Ala Met Val 

35 40 45 

Asn Ser Thr Ser Pro Gin Ser Gin Leu Asn Asp Ser Ser Glu Val Leu 

50 55 60 

Pro Val Asp Ser Phe Gly Gly Leu Ser Lys Ala Pro Lys Ser Leu Pro 
65 70 75 80 

Ala Lys Ser Ser He Leu Gly Gly Gin Phe Ala He Trp Glu Arg Trp 

85 90 95 

Gly Pro Pro Gin Glu Arg Ser Arg Leu Cys Ser He Ala Leu Ser Gly 

100 105 110 

Met Leu Leu Gly Cys Phe Thr Ala He Leu He Gly Gly Phe He Ser 

115 120. 125 

Glu Thr Leu Gly Trp Pro Phe Val Phe Tyr He Phe Gly Gly Val Gly 

130 135 140 

Cys Val Cys Cys Leu Leu Trp Phe Val Val He Tyr Asp Asp Pro Val 
145 150 155 160 

Ser Tyr Pro Trp He Ser Thr Ser Glu Lys Glu Tyr He He Ser Ser 

165 170 175 

Leu Lys Gin Gin Val Gly Ser Ser Lys Gin Pro Leu Pro He Lys Ala 

180 185 190 

Met Leu Arg Ser Leu Pro He Trp Ser He Cys Leu Gly Cys Phe Ser 

195 200 205 

His Gin Trp Leu Val Ser Thr Met Val Val Tyr He Pro Thr Tyr He 

210 215 220 

Ser Ser Val Tyr His Val Asn lie Arg Asp Asn Gly Leu Leu Ser Ala 
225 230 235 240 

Leu Pro Phe He Val Ala Trp Val He Gly Met Val Gly Gly Tyr Leu 

245 250 255 

Ala Asp Phe Leu Leu Thr Lys Lys Phe Arg Leu He Thr Val Arg Lys 

260 265 270 

He Ala Thr He Leu Gly Ser Leu Pro Ser Ser Ala Leu lie Val Ser 

275 280 285 

Leu Pro Tyr Leu Asn Ser Gly Tyr He Thr Ala Thr Ala Leu Leu Thr 

290 295 300 

Leu Ser Cys Gly Leu Ser Thr Leu Cys Gin Ser Gly lie Tyr lie Asn 
305 310 315 320 

Val Leu Asp lie Ala Pro Arg Tyr Ser Ser Phe Leu Met Gly Ala Ser 

325 330 335 

Arg Gly Phe Ser Ser lie Ala Pro Val lie Val Pro Thr Val Ser Gly 

340 345 350 

Phe Leu Leu Ser Gin Asp Pro Glu Phe Gly Trp Arg Asn Val Phe Phe 

355 360 365 

Leu Leu Phe Ala Val Asn Leu Leu Gly Leu Leu Phe Tyr Leu He Phe 

370 375 380 

Gly Glu Ala Asp Val Gin Glu Trp Ala Lys Glu Arg Lys Leu Thr Arg 
385 390 395 400 

Leu 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1643 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



AGAACGGTGA 


GGATGACCGA 


CGTATAGGCG 


AGAGCCTAGG 


TACGCCATGC 


CAGGTCACCG 


60 


GTCCGGCAAT 


TCCCGGGTCG 


ACCCACGCGT 


CCGCTTGGAG 


GGACGCTGGG 


TTCAACTTGA 


120 


AGCCCTTCCA 


CAGACATTAA 


GTCGGTGAAA 


ACCATTCACT 


AGGAGAGGAG 


AAACACAATG 


180 


GCCACCAAGA 


CAGAGTTGAG 


TCCCACAGCA 


AGGGAGAGCA 


AGAACGCACA 


AGATATGCAA 


240 


GTGGATGAGA 


CACTGATCCC 


CAGGAAAGTT 


CCAAGTTTAT 


GTTCTGCTCG 


CTATGGAATA 


300 


GCCCTCGTCT 


TACATTTCTG 


CAATTTCACA 


ACGATAGCAC 


AAAATGTCAT 


CATGAACATC 


360 


ACCATGGTAG 


CCATGGTCAA 


CAGCACAAGC 


CCTCAATCCC 


AGCTCAATGA 


TTCCTCTGAG 


420 


GTGCTGCCTG 


TTGACTCATT 


TGGTGGCCTA 


AGTAAAGCCC 


CAAAGAGTCT 


TCCTGCAAAG 


480 


TCCTCAATAC 


TTGGGGGTCA 


GTTTGCAATT 


TGGGAAAGGT 


GGGGCCCTCC 


ACAAGAACGA 


540 


AGCAGACTCT 


GCAGCATTGC 


TTTATCAGGA 


ATGTTACTGG 


GATGCTTTAC 


TGCCATCCTC 


600 


ATAGGTGGCT 


TCATTAGTGA 


AACCCTTGGG 


TGGCCCTTTG 


TCTTCTATAT 


CTTTGGAGGT 


660 


GTTGGCTGTG 


TCTGCTGCCT 


TCTCTGGTTT 


GTTGTGATTT 


ATGATGACCC 


CGTTTCCTAT 


720 


CCATGGATAA 


GCACCTCAGA 


AAAAGAATAC 


ATCATATCCT 


CCTTGAAACA 


ACAGGTCGGG 


780 


TCTTCTAAGC 


AGCCTCTTCC 


CATCAAAGCT 


ATGCTCAGAT 


CTCTACCCAT 


TTGGTCCATA 


840 


TGTTTAGGCT 


GTTTCAGCCA 


TCAATGGTTA 


GTTAGCACAA 


TGGTTGTATA 


CATACCAACT 


900 


TACATCAGCT 


CTGTGTACCA 


TGTTAACATC 


AGAGACAATG 


GACTTCTATC 


TGCCCTTCCT 


960 


TTTATTGTTG 


CCTGGGTCAT 


AGGCATGGTG 


GGAGGCTATC 


TGGCAGATTT 


CCTTCTAACC 


1020 


AAAAAGTTTA 


GACTCATCAC 


TGTGAGGAAA 


ATTGCCACAA 


TTTTAGGAAG 


TCTCCCCTCT 


1080 


TCAGCACTCA 


TTGTGTCTCT 


GCCTTACCTC 


AATTCCGGCT 


ATATCACAGC 


AACTGCCTTG 


1140 


CTGACGCTCT 


CTTGCGGATT 


AAGCACATTG 


TGTCAGTCAG 


GGATTTATAT 


CAATGTCTTA 


1200 


GATATTGCTC 


CAAGGTATTC 


CAGTTTTCTC 


ATGGGAGCAT 


CAAGAGGATT 


TTCGAGCATA 


1260 


GCACCTGTCA 


TTGTACCCAC 


TGTCAGCGGA 


TTTCTTCTTA 


GTCAGGACCC 


TGAGTTTGGG 


1320 


TGGAGGAATG 


TCTTCTTCTT 


GCTGTTTGCC 


GTTAACCTGT 


TAGGACTACT 


CTTCTACCTC 


1380 


ATATTTGGAG 


AAGCAGATGT 


CCAAGAATGG 


GCTAAAGAGA 


GAAAACTCAC 


TCGTTTATGA 


1440 


AGTTATCCCA 


CCTTGGATGG 


AAAAGTCATT 


AGGCACCGTA 


TTGCATAAAA 


TAGAAGGCTT 


1500 


CCGTGATGAA 


AATACCAGTG 


AAAAGATTTT 


TTTTTCCTGT 


GGCTCTTTTC 


AATTATGAGA 


1560 


TCAGTTCATT 


ATTTTATTCA 


GACTTTTTTT 


TGAGAGAAAT 


GTAAGATGAA 


TAAAAATTCA 


1620 


AATAAAATGA 


TAACTAAGAA 


TGC 








1643 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 450532 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Met Gin Met Asp Asn Arg Leu Pro Pro Lys Lys Val Pro Gly Phe Cys 

15 10 15 

Ser Phe Arg Tyr Gly Leu Ser Phe Leu Val His Cys Cys Asn Val lie 

20 25 30 

He Thr Ala Gin Arg Ala Cys Leu Asn Leu Thr Met Val Val Met Val 
35 40 45 
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Asn Ser Thr Asp Pro His Gly Leu Pro Asn Thr Ser Thr Lys Lys Leu 

50 55 60 

Leu Asp Asn lie Lys Asn Pro Met Tyr Asn Trp Ser Pro Asp lie Gin 
65 70 75 80 

Gly He He Leu Ser Ser Thr Ser Tyr Gly Val He He He Gin Val 

85 90 95 

Pro Val Gly Tyr Phe Ser Gly He Tyr Ser Thr Lys Lys Met He Gly 

100 105 110 

Phe Ala Leu Cys Leu Ser Ser Val Leu Ser Leu Leu He Pro Pro Ala 

115 120 125 

Ala Gly He Gly Val Ala Trp Val Val Val Cys Arg Ala Val Gin Gly 

130 135 140 

Ala Ala Gin Gly He Val Ala Thr Ala Gin Phe Glu He Tyr Val Lys 
145 150 155 160 

Trp Ala Pro Pro Leu Glu Arg Gly Arg Leu Thr Ser Met Ser Thr Ser 

165 170 175 

Gly Phe Leu Leu Gly Pro Phe He Val Leu Leu Val Thr Gly Val He 

180 185 190 

Cys Glu Ser Leu Gly Trp Pro Met Val Phe Tyr He Phe Gly Ala Cys 

195 200 205 

Gly Cys Ala Val Cys Leu Leu Trp Phe Val Leu Phe Tyr Asp Asp Pro 

210 215 220 

Lys Asp His Pro Cys He Ser He Ser Glu Lys Glu Tyr He Thr Ser 
225 230 235 240 

Ser Leu Val Gin Gin Val Ser Ser Ser Arg Gin Ser Leu Pro He Lys 

245 250 255 

Ala lie Leu Lys Ser Leu Pro Val Trp Ala He Ser He Gly Ser Phe 

260 265 270 

Thr Phe Phe Trp Ser His Asn He Met Thr Leu Tyr Thr Pro Met Phe 

275 280 285 

He Asn Ser Met Leu His Val Asn He Lys Glu Asn Gly Phe Leu Ser 

290 295 300 

Ser Leu Pro Tyr Leu Phe Ala Trp He Cys Gly Asn Leu Ala Gly Gin 
305 310 315 320 

Leu Ser Asp Phe Phe Leu Thr Arg Asn He Leu Ser Val He Ala Val 

325 330 335 

Arg Lys Leu Phe Thr Ala Ala Gly Phe Leu Leu Pro Ala He Phe Gly 

340 345 350 

Val Cys Leu Pro Tyr Leu Ser Ser Thr Phe Tyr Ser He Val He Phe 

355 360 365 

Leu He Leu Ala Gly Ala Thr Gly Ser Phe Cys Leu Gly Gly Val Phe 

370 375 380 

lie Asn Gly Leu Asp lie Ala Pro Arg Tyr Phe Gly Phe He Lys Ala 
385 390 395 400 

Cys Ser Thr Leu Thr Gly Met lie Gly Gly Leu lie Ala Ser Thr Leu 

405 410 415 

Thr Gly Leu He Leu Lys Gin Asp Pro Glu Ser Ala Trp Phe Lys Thr 

420 425 430 

Phe lie Leu Met Ala Ala He Asn Val Thr Gly Leu lie Phe Tyr Leu 

435 440 445 

lie Val Ala Thr Ala Glu He Gin Asp Trp Ala Lys Glu Lys Gin His 

450 455 460 

Thr Arg Leu 
465 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 560 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 507415 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Phe Arg Gin Glu Glu Phe Arg Lys Leu Ala Gly Arg Ala Leu 

15 10 15 

Gly Arg Leu His Arg Leu Leu Glu Lys Arg Gin Glu Gly Ala Glu Thr 

20 25 30 

Leu Glu Leu Ser Ala Asp Gly Arg Pro Val Thr Thr His Thr Arg Asp 

35 40 45 

Pro Pro Val Val Asp Cys Thr Cys Phe Gly Leu Pro Arg Arg Tyr lie 

50 55 60 

He Ala He Met Ser Gly Leu Gly Phe Cys He Ser Phe Gly He Arg 
65 70 75 80 

Cys Asn Leu Gly Val Ala He Val Ser Met Val Asn Asn Ser Thr Thr 

85 90 95 

His Arg Gly Gly His Val Val Val Gin Lys Ala Gin Phe Asn Trp Asp 

100 105 110 

Pro Glu Thr Val Gly Leu He His Gly Ser Phe Phe Trp Gly Tyr He 

115 120 125 

Val Thr Gin He Pro Gly Gly Phe He Cys Gin Lys Phe Ala Ala Asn 

130 135 140 

Arg Val Phe Gly Phe Ala He Val Ala Thr Ser Thr Leu Asn Met Leu 
145 150 155 160 

He Pro Ser Ala Ala Arg Val His Tyr Gly Cys Val He Phe Val Arg 

165 170 175 

He Leu Gin Gly Leu Val Glu Gly Val Thr Tyr Pro Ala Cys His Gly 

180 185 190 

He Trp Ser Lys Trp Ala Pro Pro Leu Glu Arg Ser Arg Leu Ala Thr 

195 200 205 

Thr Ala Phe Cys Gly Ser Tyr Ala Gly Ala Val Val Ala Met Pro Leu 

210 215 220 

Ala Gly Val Leu Val Gin Tyr Ser Gly Trp Ser Ser Val Phe Tyr Val 
225 230 235 240 

Tyr Gly Ser Phe Gly He Phe Trp Tyr Leu Phe Trp Leu Leu Val Ser 

245 250 255 

Tyr Glu Ser Pro Ala Leu His Pro Ser He Ser Glu Glu Glu Arg Lys 

260 265 270 

Tyr He Glu Asp Ala He Gly Glu Ser Ala Lys Leu Met Asn Pro Val 

275 280 285 

Thr Lys Phe Asn Thr Pro Trp Arg Arg Phe Phe Thr Ser Met Pro Val 

290 295 300 

Tyr Ala He He Val Ala Asn Phe Cys Arg Ser Trp Thr Phe Tyr Leu 
305 310 315 320 

Leu Leu He Ser Gin Pro Ala Tyr Phe Glu Glu Val Phe Gly Phe Glu 

325 330 335 

He Ser Lys Val Gly Leu Val Ser Ala Leu Pro His Leu Val Met Thr 

340 345 350 

He He Val Pro lie Gly Gly Gin He Ala Asp Phe Leu Arg Ser Arg 

355 360 365 

His He Met Ser Thr Thr Asn Val Arg Lys Leu Met Asn Cys Gly Gly 

370 375 380 

Phe Gly Met Glu Ala Thr Leu Leu Leu Val Val Gly Tyr Ser His Ser 
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385 390 395 400 

Lys Gly Val Ala He Ser Phe Leu Val Leu Ala Val Gly Phe Ser Gly 

405 410 415 

Phe Ala He Ser Gly Phe Asn Val Asn His Leu Asp He Ala Pro Arg 

420 425 430 

Tyr Ala Ser He Leu Met Gly He Ser Asn Gly Val Gly Thr Leu Ser 

435 440 445 

Gly Met Val Cys Pro He He Val Gly Ala Met Thr Lys His Lys Thr 

450 455 460 

Arg Glu Glu Trp Gin Tyr Val Phe Leu He Ala Ser Leu Val His Tyr 
465 470 475 480 

Gly Gly Val He Phe Tyr Gly Val Phe Ala Ser Gly Glu Lys Gin Pro 

485 490 495 

Trp Ala Glu Pro Glu Glu Met Ser Glu Glu Lys Cys Gly Phe Val Gly 

500 505 510 

His Asp Gin Leu Ala Gly Ser Asp Glu Ser Glu Met Glu Asp Glu Val 

515 520 525 

Glu Pro Pro Gly Ala Pro Pro Ala Pro Pro Pro Ser Tyr Gly Ala Thr 

530 535 540 

His Ser Thr Val Gin Pro Pro Arg Pro Pro Pro Pro Val Arg Asp Tyr 
545 550 555 560 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATTTATATCA ATGTCTTAGA TATTGCTCCA AGGTATTCCA GTTTTCTCAT GGGAGCATCA 60 

AGAGGATTTT CGAGCATAGC ACCTGTCATT GTACCCACTG TCAGTGGATT TCTTCTTAGT 120 

CAGGACCCTG AGTTTGGGTG GAGGAATGTC TTCTTCTTGC TGTTTGCCGT TAACCTGTTA 180 

GGACTACTCT TCTACCTCAT ATTTGGAGAA GCAGATGTCC AAGAATGGGC TAAAGAGAGA 240 

AAACTCACTC GTTTATGAAG TTATCCCACC TT 272 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: XLR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CTTGATGCTC CCATGAGAAA ACTGG 25 

(2) INFORMATION FOR SEQ ID NO: 7: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: XLF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



AGGATTTTCG AGCATAGCAC CTGTC 



